The limits and robustness of reinforcement learning in Lewis signalling games

نویسندگان

  • David Catteeuw
  • Bernard Manderick
چکیده

Lewis signaling games are a standard model to study the emergence of language. We introduce win-stay/lose-inaction, a random process that only updates behavior on success and never deviates from what was once successful, prove that it always ends up in a state of optimal communication in all Lewis signaling games, and predict the number of interactions it needs to do so: N3 interactions for Lewis signaling games with N equiprobable types. We show three reinforcement learning algorithms (Roth-Erev learning, Q-learning, and Learning Automata) that can imitate win-stay/lose-inaction and can even cope with errors in Lewis signaling games.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hybrid learning in signalling games

Lewis-Skyrms signaling games (Lewis 1969; Skyrms 2010) have been studied under a variety of low-rationality learning dynamics (Barrett 2006; Barrett and Zollman 2009; Huttegger, Skyrms, Smead, and Zollman 2010; Huttegger, Skyrms, Tarrès, and Wagner 2014; Huttegger, Skyrms, and Zollman 2014). Reinforcement dynamics are stable but slow and prone to evolving suboptimal signaling conventions. A low...

متن کامل

The Limits of Reinforcement Learning in Lewis Signaling Games

We study how communication systems can emerge through repeated interaction between two individuals. We apply three reinforcement learning algorithms (Roth-Erev learning, Learning Automata, and Q-Learning) to the repeated Lewis signaling game, a game theoretic model of a communication problem. Our experiments show that each of these algorithms always reach a state of optimal communication even f...

متن کامل

The Evolution of Coding in Signaling Games

Signaling games with reinforcement learning have been used to model the evolution of term languages (Lewis 1969, Convention. Cambridge, MA: Harvard University Press; Skyrms 2006, “Signals” Presidential Address. Philosophy of Science Association for PSA). In this article, syntactic games, extensions of David Lewis’s original sender– receiver game, are used to illustrate how a language that explo...

متن کامل

Utilizing Generalized Learning Automata for Finding Optimal Policies in MMDPs

Multi agent Markov decision processes (MMDPs), as the generalization of Markov decision processes to the multi agent case, have long been used for modeling multi agent system and are used as a suitable framework for Multi agent Reinforcement Learning. In this paper, a generalized learning automata based algorithm for finding optimal policies in MMDP is proposed. In the proposed algorithm, MMDP ...

متن کامل

The Impact of Playing Word Games on Young Iranian EFL Learners’ Vocabulary Learning and Retention

Acquiring adequate vocabulary in a foreign language is very important but often difficult. Considering the importance of learner’s vocabulary learning and retention, the present study aimed at examining the impact of playing word games on young Iranian EFL learners’ vocabulary learning and retention at Irandoostan language Institute in Tabriz. To that end, 50 female learners at the age range of...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Connect. Sci.

دوره 26  شماره 

صفحات  -

تاریخ انتشار 2014